adding the API contracts by MikeLippincott · Pull Request #7 · WayScience/ZedProfiler

MikeLippincott · 2026-04-20T17:54:02Z

Description

This PR adds data contracts for validating inputs and outputs. This is an initial step and will be added to as other modules get added in the future.

What kind of change(s) are included?

Documentation (changes docs or other related content)
Bug fix (fixes an issue).
Enhancement (adds functionality).
Breaking change (these changes would cause existing functionality to not work as expected).

Checklist

Please ensure that all boxes are checked before indicating that this pull request is ready for review.

I have read and followed the CONTRIBUTING.md guidelines.
I have searched for existing content to ensure this is not a duplicate.
I have performed a self-review of these additions (including spelling, grammar, and related).
These changes pass all pre-commit checks.
I have added comments to my code to help provide understanding
I have added a test which covers the code changes found within this PR
I have deleted all non-relevant text in this pull request template.

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

d33bs

Nice job! I'm requesting changes mainly because I think there's existing tooling you could make use of to reduce the amount of new code you need to create.

If you feel strongly about keeping things as-is just let me know and I'll circle back to give this more thought.

d33bs · 2026-04-23T16:20:54Z

 # ZedProfiler

-[![Coverage](https://img.shields.io/badge/coverage-87%25-green)](#quality-gates)
+[![Coverage](https://img.shields.io/badge/coverage-99%25-brightgreen)](#quality-gates)


How could we validate this coverage? At the moment this seems static, meaning it must be updated each time by some unknown force. If you have a reproducible method which updates this, consider documenting it under contributing.md. Otherwise, consider removing it altogether because part of code coverage is about building trust and reproducibility (which we directly collide with if we supply a measurement without any validation). A side effect of doing this is that it should increase development velocity by making it impossible to avoid an update if coverage changes.

d33bs · 2026-04-23T16:24:12Z

-The package accepts either:
+The package accepts:
 - Single-channel 3D arrays shaped (z, y, x)
- Multi-channel 4D arrays shaped (c, z, y, x)


What happened to 4d arrays?

d33bs · 2026-04-23T16:27:22Z

+EXPECTED_SPATIAL_DIMS = 3
+TWO_DIMENSIONAL = 2
+FOUR_DIMENSIONAL = 4
+FIVE_OR_MORE_DIMENSIONS = 5
+REQUIRED_RETURN_KEYS = ("image_array", "features", "metadata")


Consider using Pydantic to help define classes which may be used for validation broadly. You could also likely incorporate Pandera, if DataFrames are to be included somehow, which integrates nicely with Pydantic. Doing this will save you effort from building bespoke data / contract validation tooling.

As you go through this effort you can use concepts from object-orientated programming (OOP) to help you modulate what is being validated and worked on at an atomic level within the software. For instance, what do these attributes pertain to - maybe it's "widget_x"? You can label it as such, using defined objects which are both validated and also passed by type to other portions of the software. You might eventually find that you also want "widget_y" with separate attributes too. Keep separation of concerns in mind and at the same time, avoid over-specific OOP ("super good enough" is often best).

d33bs · 2026-04-23T16:39:43Z

@@ -1,31 +1,251 @@
 """Core data contracts shared across featurizers.


I strongly encourage greater specificity here with concern to "what kind" of contract, now that you're building this out. Specifically, Design by contract, Hoare Triples, and Abstract Data Typing come to mind.

d33bs · 2026-04-23T16:42:43Z

+    return True
+
+
+def validate_return_schema_contract(


I'm unsure if this comes into play here exactly, but this came to mind: Consider using beartype to help validate objects at time of use.

d33bs · 2026-04-23T16:48:12Z

+TEST_DIR = Path(__file__).parent
+sys.path.insert(0, str(TEST_DIR))


This looks like an antipattern which arose from how things are executed for tests. Consider finding a way to remove this code. Sometimes adding an __init__.py file can help, but it depends on what happened that made you add this in (I can't tell from just the code). Pytest can sometimes be a bit cranky about internal imports unless you explicitly tell it that the the test dir is a package with the init file.

d33bs · 2026-04-23T16:50:02Z

+# ============================================================================
+# FIXTURE: Minimal valid profile
+# ============================================================================


Consider avoiding this bespoke commenting pattern and instead using docstrings inside the fixture definition. Comment applies wherever this pattern popped up.

d33bs · 2026-04-23T16:54:22Z

+# FIXTURE: Small 3D image profile
+# ============================================================================
+@pytest.fixture
+def small_image_profile() -> TestProfile:


Consider using pytest's parametrize to collapse the code needed for repeated iteration over similar data. Comment applies to many tests in this pr.

d33bs · 2026-04-23T16:59:23Z

@@ -1,4 +1,4 @@
-"""Tests for package export ergonomics."""
+r"""Tests for package export ergonomics."""


The r here looks like a typo. Consider removing.

d33bs · 2026-04-23T17:00:27Z

  "fire>=0.7.1",
  "jinja2>=3.1.6",
  "pandas>=3.0.2",
+  "pyarrow>=23.0.1",


I couldn't tell exactly, but is pyarrow needed for this PR?

adding the API contracts

c0cede1

MikeLippincott requested a review from Copilot April 21, 2026 15:24

Copilot started reviewing on behalf of MikeLippincott April 21, 2026 15:24 View session

MikeLippincott requested a review from d33bs April 21, 2026 15:37

Copilot AI reviewed Apr 21, 2026

View reviewed changes

d33bs requested changes Apr 23, 2026

View reviewed changes

d33bs mentioned this pull request Apr 23, 2026

add loaders #9

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding the API contracts#7

adding the API contracts#7
MikeLippincott wants to merge 1 commit intoWayScience:mainfrom
MikeLippincott:data_contracts

MikeLippincott commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

d33bs left a comment

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

d33bs Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1,31 +1,251 @@
		"""Core data contracts shared across featurizers.

		TEST_DIR = Path(__file__).parent
		sys.path.insert(0, str(TEST_DIR))

		@@ -1,4 +1,4 @@
		"""Tests for package export ergonomics."""
		r"""Tests for package export ergonomics."""

Conversation

MikeLippincott commented Apr 20, 2026

Description

What kind of change(s) are included?

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

d33bs left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants